Statistical Sentence Chunking Using Map Reduce
نویسنده
چکیده
منابع مشابه
Realization of long sentences using chunking
We propose sentence chunking as a way to reduce the time and memory costs of realization of long sentences. During chunking we divide the semantic representation of a sentence into smaller components which can be processed and recombined without loss of information. Our meaning representation of choice is Dependency Minimal Recursion Semantics (DMRS). We show that realizing chunks of a sentence...
متن کاملBitext Alignment for Statistical Machine Translation
Bitext alignment is the task of finding translation equivalence between documents in two languages, collections of which are commonly known as bitext. This dissertation addresses the problems of statistical alignment at various granularities from sentence to word with the goal of creating Statistical Machine Translation (SMT) systems. SMT systems are statistical pattern processors based on para...
متن کاملJapanese Dependency Analysis using Cascaded Chunking
In this paper, we propose a new statistical Japanese dependency parser using a cascaded chunking model. Conventional Japanese statistical dependency parsers are mainly based on a probabilistic model, which is not always efficient or scalable. We propose a new method that is simple and efficient, since it parses a sentence deterministically only deciding whether the current segment modifies the ...
متن کاملWorking memory and binding in sentence recall
0749-596X/$ see front matter 2009 Elsevier Inc doi:10.1016/j.jml.2009.05.004 * Corresponding author. Fax: +44 (0)1904 433181 E-mail address: [email protected] (G.J. Hitch). A series of experiments explored whether chunking in short-termmemory for verbal materials depends on attentionally limited executive processes. Secondary tasks were used to disrupt components of working memory and chunking wa...
متن کاملTwo-Level Metadata Management for Data Deduplication System
Data deduplication is an essential solution to reduce storage space requirement. Especially chunking based data deduplication is very effective for backup workloads which tend to be files that evolve slowly, mainly through small changes and additions. In this paper, we introduce a novel data deduplication scheme which can be efficiently used with low bandwidth network in a rapid time. The key p...
متن کامل